Mapping Phrase Structures to Dependency Structures in the Case of (Partially) Free Word Order Languages

نویسنده

  • Bernd Bohnet
چکیده

Les corpus sont très utiles pour de nombreuses tâches dans le domaine du traitement automatique des langues naturelles. Les corpus annotés syntaxiquement sont devenus une ressource importante en TAL. Ils sont couramment utilisés, par exemple comme banc d’essai pour la génération, l’analyse et la désambiguı̈sation sémantique, et comme source pour l’acquisition de ressources (collocations, information sur la sous-catégorisation, extraction de grammaire). Lorsqu’on utilise les structures de dépendance pour le TAL, le manque de corpus annotés en structures de dépendance constitue un handicap. Nous présentons une approche fondée sur une grammaire de graphes pour convertir des corpus annotés en structures syntagmatiques en corpus annotés en dépendances. Cette approche fonctionne pour des langues à ordre de mots (partiellement) libre et fixe.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Engineering in Persian Dependency Parser

Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...

متن کامل

Converting Phrase Structures to Dependency Structures in Sanskrit

Two annotations schemes for presenting the parsed structures are prevalent viz. the constituency structure and the dependency structure. While the constituency trees mark the relations due to positions, the dependency relations mark the semantic dependencies. Free word order languages like Sanskrit pose more problems for constituency parses since the elements within a phrase are dislocated. In ...

متن کامل

Dependency-Based Hybrid Model of Syntactic Analysis for the Languages with a Rather Free Word Order

Although phrase structure grammars have turned out to be a more popular approach for analysis and representation of the natural language syntactic structures, dependency grammars are often considered as being more appropriate for free word order languages. While building a parser for Latvian, a language with a rather free word order, we found (similarly to TIGER project for German and Talbanken...

متن کامل

تأثیر ساخت‌واژه‌ها در تجزیه وابستگی زبان فارسی

Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...

متن کامل

TAG and Topology

Classical phrase structure tries to collapse syntactic and ordering information. However, this conception of the syntax of language is erroneous because it supposes that word order is always an immediate reflection of the syntactic hierarchy and that any deviation from this constitutes a problem, denoted by terms with negative undertones like scrambling. Modern linguistic frameworks propose a d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007